Resolving Discourse-Deictic Pronouns: A Two-Stage Approach to Do It

نویسندگان

  • Sujay Kumar Jauhar
  • Raul Guerra
  • Edgar Gonzàlez Pellicer
  • Marta Recasens
چکیده

Discourse deixis is a linguistic phenomenon in which pronouns have verbal or clausal, rather than nominal, antecedents. Studies have estimated that between 5% and 10% of pronouns in non-conversational data are discourse deictic. However, current coreference resolution systems ignore this phenomenon. This paper presents an automatic system for the detection and resolution of discourse-deictic pronouns. We introduce a two-step approach that first recognizes instances of discourse-deictic pronouns, and then resolves them to their verbal antecedent. Both components rely on linguistically motivated features. We evaluate the components in isolation and in combination with two state-of-the-art coreference resolvers. Results show that our system outperforms several baselines, including the only comparable discourse deixis system, and leads to small but statistically significant improvements over the full coreference resolution systems. An error analysis lays bare the need for a less strict evaluation of this task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Resolving Discourse Deictic Anaphora in Dialogues

Most existing anaphora resolution algorithms are designed to account only for anaphors with NP-antecedents. This paper describes an algorithm for the resolution of discourse deictic anaphors, which constitute a large percentage of anaphors in spoken dialogues. The success of the resolution is dependent on the classification of all pronouns and demonstratives into individual, discourse deictic a...

متن کامل

Deictic Reference and Discourse Structure

Research on the factors and processes involved in pronoun interpretation has to date concentrated on anaphoric pronouns. Results have supported the now widely-held view that discourse understanding involves the creation of a partial, mental model of the situation described through the discourse. Anaphoric pronouns are taken to refer to elements of that model (often called discourse referents or...

متن کامل

Resolving Discourse Deictic Anaphors in Tutorial Dialogues

Most of the anaphoric resolution algorithms developed so far focus on anaphors with NP antecedents, be it inter-sentential or intrasentential. The main focus of this paper is to resolve various other types of anaphors such as discourse deictic anaphors found in computermediated tutorial dialogues on physics. We do this first through a corpus-based study of physics tutoring dialogues. Our approa...

متن کامل

Dialogue Acts, Synchronizing Units, and Anaphora Resolution

In this paper, we present the results of a corpus analysis, and a model of anaphora resolution in spontaneous spoken dialogues. The main finding of our corpus analysis is that less than half the pronouns and demonstratives have NP antecedents in the preceding text; 22% have sentential antecedents and the remainder have no identifiable linguistic antecedents. As part of the corpus analysis we pr...

متن کامل

Identifying zero pronouns in Japanese dialogue

Japanese dialogue containing zero pronouns is analyzed for the purpose of automatic Japanese-Engl ish conversation translation. Topic-driven Discourse Structure is formalized which ident i f ies ma in ly non -human zero pronouns as a by-product. Other zero pronouns are handled us ing cogni t ive and soc io l ingu i s t i c i n fo rma t ion in honorific, deictic, speech-act and mental predicates...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015